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Abstract 

Behaviors of Winfree's tile assembly systems (TASs) at high temper- 
atures are investigated in combination with integer programming of a 
specific form called threshold programming. First, we propose a way to 
build bridges from the Boolean satisfiability problem (Sat) to threshold 
programming, and further to TAS's behavior, in order to prove the NP- 
hardness of optimizing temperatures of TASs that behave in a way given 
as input. These bridges will take us further to two important results on 
the behavior of TASs at high temperatures. The first says that arbitrarily 
high temperatures are required to assemble some shape by a TAS of "rea- 
sonable" size. The second is that for any temperature r > 4 given as a 
parameter, it is NP-hard to find the minimum size TAS that self-assembles 
a given shape and works at a temperature below r. 

1 Introduction 

The abstract Tile Assembly Model (aTAM) , which has been introduced by Win- 
free P3] based on a dynamic version of Wang tiling [T3], is a model of "pro- 
grammable crystal growth" with algorithmically-rich theoretical background 
and results. Self-assembling systems in this model are called tile assembly 
systems (TASs). Self-assembling (square) tiles have been experimentally im- 
plemented as DNA double-crossover molecules in 1998 jT5] , which are designed 
so ingeniously that they bind deterministically into a single target shape, even 
subject to the chaotic nature of molecules floating randomly in a well-mixed 
solution. The last three decades have seen drastic advancements on the relia- 
bility of DNA tile assembly; in fact, the error rate of 10% per tile in 2004 was 
improved down to 0.13% in 2009 [3J. 

In the aTAM, sticky ends are abstracted to be a glue label, and their 
strengths are assigned by a strength function g. A square tile adheres stably 
to an aggregate of tiles whenever the sum of the strengths of neighboring sides 
with matching labels according to g exceeds a threshold r, which is a system 
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parameter called temperature. "Temperature" in aTAM is rather metaphorical 
than actually describing the physical temperature of the experimental environ- 
ment. It nonetheless can be interpreted as a physical metric for the granularity 
with which different energy levels must be distinguished in order for TASs to 
behave as expected. Certain "behaviors" of TASs were proved to require three 
different energy levels such that the gap between two of them is exponentially 
larger than the gap between other two [5] (for a formal definition of TAS's be- 
havior as strength- free TAS, see Section [3]). Conventionally, the glue strengths 
(and hence, temperature) of TASs are assumed to be integers, without loss of 
generality. Indeed, this is a way of "quantizing" the minimum distinction we 
are willing to make between energies and then re-scalling so that this quantity 
is normalized to 1. 

As mentioned briefly above, the stability of the attachment of a tile at a 
position is determined by the sum of the strengths that tiles at the neighbor- 
ing positions offer via their abutting edges, relative to r. In aTAM, the basic 
building blocks, that is, tiles, are abstracted to be the lxl square so that a tile 
can be adjacent to at most four other tiles. Thus, the sum that determines the 
attachment stability consists of at most four terms. This motivates us to study 
a system of inequalities whose left-hand side consists of at most 4 terms (non- 
negative integers or constants) and whose right-hand side is r. We call such an 
inequality a r-inequality of at most 4 terms; we use this term often with the 
replacement of r by either > T (at least r) or < T (strictly less than r) to specify 
its sign. Then, we can say that r-inequalities of at most 4 terms dominate the 
behavior of a TAS at the micro, or local, level, that is, per attachment event. 
Optimizing (minimizing) r under r-incqualitics is a specific type of integer pro- 
gramming we call threshold programming (TP). In this paper, we will prove that 
TP is NP-hard even under the condition that all > T -inequalities involved be of at 
most 4 terms and all < r -inequalities involved be of at most 3 terms (Lemma [4|. 
This condition makes TPs reducible to the local behavior of a TAS, and this 
implies the NP-hardness of the problem FindOptimalStrength, which aims 
at optimizing the temperature of TASs that behave in a way specified as in- 
put (Theorem [S]) . In other words, FindOptimalStrength cannot be solved 
in a polynomial time, unless P = NP. This NP-hardness complements a result 
by Chen, Doty, and Seki [5], which affirmatively answered a problem posed by 
Adleman et al. [2]. 

The TP instances obtained in this reduction will lead us further to the study 
of macro, terminal, or global, behaviors of TASs. The global behavior simply 
concerns what shape(s) a TAS assembles, and does not mind how its underlying 
attachment events proceed. Well-examined problems on the global behavior of 
TASs includes finding the optimal design of TASs that certainly assembles the 
n x n square for a given n [TJ[2J[TU]. There and also in this paper, the optimality 
of TASs is measured by the number of distinct tile types they contain, and this 
criterion is called the tile complexity. The most important contribution we make 
in this paper along this line is the proposal of a unified framework to convert a 
system S of r-inequalities into a shape S with the property that if S is solvable 
for t = k, then the resulting shape S can be assembled by a temperature-fc TAS 
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of "reasonable size" (Property [IJ . According to this framework, it suffices to 
choose a system of r-inequalities that requires r > k for its solvability in order 
to obtain a shape which prefers the temperature k to those below with respect 
to tile complexity (Theorem [7]) . A choice of another system proves to enable 
this framework prove also that, for any r > 4, it is NP-hard to compute the 
minimum number of tile types required for TASs at a temperature at most r to 
self-assemble the shape (Theorem [8]). 

Current laboratory techniques allow us to handle only at most 2 distinct 
energy levels, that is, temperature 2 (even making a distinction between two 
energy levels is difficult, see, e.g., [5] and references therein, but successful de- 
signs of self-assembling molecular systems at temperature 2 have been reported 
[3 [9]). Therefore, as of this date, we cannot help but interpret our results 
as computational infeasibility of determining whether behaviors of TAS can be 
implemented physically. 

This paper is organized as follows. After the preliminary section, in Sec- 
tion |31 we will formalize the local behavioral equivalence among TASs. The 
section consists of opening paragraphs that introduce the formal definition of 
the equivalence and formalize the problem FindOptimalStrength, being fol- 
lowed by two subsections: Section 13. f I is a preliminary for threshold program- 
ming and makes necessary preparations for the proof of the NP-hardness of 
FindOptimalStrength. Then in Section [32] we present the succession of NP- 
hardness results: quadripartite f-lN-3-SAT (LemmaQ, threshold programming 
(Lemma|4]), and then FindOptimalStrength (Theorem[6]), chained by Karp- 
reductions. Using these results, in Section|4j we will prove the above-mentioned 
theorems on the global behavior of TASs. 

2 Abstract Tile Assembly Model 

This section aims at tersely introducing the reader to the aTAM. An excellent 
tutorial can be found, for instance, in |10j . 

Let £ be an alphabet, and by £*, we denote the set of finite strings over 
S. By Z and N, we denote the set of integers and the set of positive integers, 
respectively, and let No = N U {0}. In aTAM, 1? is especially considered either 
as the two-dimensional integer lattice or as the set of all points on it. 

Given a set of points A C Z 2 on the integer lattice, the full grid graph of A is 
the undirected graph G l A — (V,E), where V = A and for all u, v € V, there is an 
edge between u and v if and only if | |u — v| | 2 = I, where 1 1 • 1 1 2 is the Manhattan 
distance, that is, u and v are adjacent points. A shape is a set SCZ 2 such that 
Gg is connected, and we denote the set of all finite shapes by C V(Z 2 ). Let 
N, W, S, E stand for the respective directions north, west, south, and east, and be 
also interpreted as the respective unit vectors (0, 1), (—1, 0), (0, —I), (I, 0). 

A tile type t is a quadruple (eS'xE'xS'xS*, and is regarded as a unit 
square with four sides listed in the counter-clockwise order starting at the north 
(N), each having a glue label (a.k.a., glue) taken from £*. For each direction 
d £ {N,W, S,E}, let t(d) be the glue label at the d side of t. Let T be a finite 
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set of tile types, and let us denote the (finite) set of all glues of tile types in T 
by A(T) C S*. An assembly (a.k.a., supertile) is a positioning of tiles of types 
in T on (part of) the integer lattice 1? . It does not have to be a tessellation. 
Hence, we can say that an assembly is a partial function 1? T. Given 
two assemblies a, j3 : Z 2 ---> T, a is a sub-assembly of /3, written as a C /3, if 
dom(a) C dom(/3) and for every point p G dom(a), a(p) = /3(p), where dom 
denotes the domain of the function. 

The aTAM models dynamics in the growth of assemblies based on the inter- 
action among its basic building blocks, tiles. A strength function g : A(T) — » No 
endows tiles with an ability to interact with its neighboring tiles by assigning the 
strength g(£) to the matching label I of their abutting edges. If the labels do not 
match or g(£) = 0, these tiles do not interact; otherwise, they do. An assembly 
a and a strength function g induce a binding graph, which is a grid graph whose 
vertices are dom(a) and for two neighboring positions pi,P2 G dom (a), there is 
an edge between p\ and p2 on this graph if and only if the tiles a(pi) and a(jp2) 
interact. On this graph, an edge between vertices means that the corresponding 
tiles interact, and hence, their abutting edges share the same label £. Thus, 
we can consider that the edge is labeled with I and g gives it the weight g(£). 
The assembly is t -stable (with respect to g) if every cut of its binding graph 
has strength at least r. That is, the assembly is r-stable if at least energy r is 
required to separate it into two parts. 

A (seeded) tile assembly system (TAS) is a quadruple T — (T, a, g,r), where 
T and g are as stated above, a : 1? --• * T is a finite r-stable seed assembly, 
and t > 1 is an integer parameter called temperature. TASs are provided with 
inexhaustible supply of copies of each tile type, each copy being referred to as 
a tile. If the seed assembly a consists of a single tile, then T is said to be 
singly- seeded. In this paper, we consider only singly-seeded TASs. 

Given two r-stable assemblies a, (3, we write a —^f ft if a Q and dom(/3) \ 
dom(o;) = {p} for some position peZ 2 . Intuitively, this means that a can grow 
into j3 by the addition of a single tile at the position p. Since /3 is required to 
be r-stable, the new tile is able to bind to a with strength at least r. In this 
case, we say that a 1 '-produces (3 in one step. 

A sequence of r-stable assemblies ao, a\, . . . , otk is a 7" -assembly sequence if 
for all 1 < i < k, a«_i — a,{ holds. We write a — > T (3 and say a T-produces (3 
(in or more steps) if there is a T-assembly sequence ao, ai, . . . , ajt of length 
k = |dom(/3)\dom(a)| with ao = a and ctk — f3. (This definition of producibility 
is justified by our limited focus only onto the finite assemblies in this paper; for 
the infinite assembly, it is not appropriate; see [3] for instance.) An assembly 
a is T -producible or producible by T if o — > T a. A r-stable assembly a is (T- 
jterminal if for any r-stable assembly j3, a — > T (3 implies a = (3. Let A[T] 
be the set of assemblies producible by T, and let -4,n[7~] C A\T] be the set 
of terminal assemblies that are producible by T ■ A TAS T is directed if the 
poset (_4[T],— > r ) is directed, i.e., for each a, (3 € A[T], there exists 7 G A[T] 
such that a — 7 and (3 7. Given a shape S C I?, a TAS T strictly 
(a.k.a., uniquely) (self-)assembles S if the shape of every terminal assembly 
produced by T is S. 
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2.1 Directed tile complexity 

A directed TAS that strictly self-assembles a shape S can be regarded as a "pro- 
gram" to output the shape. A measure of how concisely one can describe such 
a TAS with respect to the number of tile types was introduced by Rothcmund 
and Winfree [TU] in the name of directed tile complexity of S. This complexity 
has been intensely investigated for TASs at the temperature 1 or 2 [2J. The 
temperature- 2 directed tile complexity of S is formally denned as follows: 



C dtilec(2)^ =min J \ T 



T = (T, a, g, 2) is a directed TAS 
that strictly self-assembles S 

This is the minimum number of tile types required for a directed TAS at the 
temperature 2 to strictly self-assemble S. Since any temperature-1 TAS can 
be simulated at the temperature 2 simply by doubling the strength associated 
to each of its labels (see also Proposition [TJ , this definition indeed has already 
taken the temperature-1 TASs into account. We parameterize this complexity 
measure by a temperature r as: 



C dtilcc(r)^ = rxlm j| T | 



T = (T,a,g,r) is a directed TAS 
that strictly self-assembles S 



and introduce it as directed tile complexity of S at the temperature r. 

As mentioned above, C dtilcc W (g) > C dtiloc < 2 ) (S) for any S. Now we show 
that for any temperature r and a positive integer fc, C dtllcc ( T )(5) > C dtaec ( feT ) (£) 
holds. 

Proposition 1. For any r £ N, TASs at the temperature r can be simulated at 
any temperature that is a multiple of r. 

Proof. This simulation is simply done by multiplying the strengths and r of a 
given TAS 7i = (T, a, g, r) by a proper constant c. The TAS thus obtained is 
T 2 = (T, a, g', ct) with g'(£) = cg(£) for each glue label £ in T. □ 

Neither this proposition nor Theorem [7] in Section |4] implies the monoton- 
ically decreasing property of C dtllcc ( r ) (S) , being considered as a function of r. 
Indeed, it is not so as being exemplified at the end of Section |4j This non- 
monotonicity motivates us to introduce the notion of directed tile complexity of 
S at the temperatures at most r. This measure is to be defined as 

C dtilec (<T)(5) = min{C dtilcc(,;) (S') | 1 < i < t}. 

As the TASs 71 and 72 in the proof of Proposition [TJ more than one TAS 
can exhibit identical behaviors at the local (attachment) level in the sense that 
whenever some of the four sides of a tile cooperate for the stable attachment in 
one TAS, a tile of the same type do so in the other TASs, though these TAS may 
be at different temperatures and may assign each label with different strengths. 
As a result, they produce the same shapes. In the next section, we will formalize 
this behavioral equivalence among TASs at the local level, and work problems 
related to the temperature and the parameterized tile complexity that were left 
open in [2j [5] . 
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3 Behavioral Equivalences among TASs 



The "behavior" of a TAS T = (T, a, g, r) is determined fully by its strength 
function g and temperature r. More precisely, g and r do so by specifying the 
local behavior of each tile type t £ T in the form of cooperation set of t with 
respect to g and r 5J, which is defined as: 

2V(*) = C {N,W,S,E} | £ de DS(<(d)) > r}. 

This is the collection of sets of four sides of t whose glues have sufficient strengths 
to bind t cooperatively. By definition, if a set of sides of t is in P ff)T (t), then any 
of its superset is also included in T> g , T (t). Any tile type whose cooperation set is 
empty can be rid from T because a tile of that type never attaches. Combining 
these together, we assume that for any t £ T, {N, W, S, E} £ V g , T (t). 

Cooperation sets provide a behavioral equivalence among TASs. Given Ti = 
(T,a,gi,Ti) and T2 = (T,a, 52,^2) that share the tile set T and seed a, if 
Dgi,Ti(t) — 2}g2,T 2 (*) f° r eacn tile type t £ T, then these TASs are said to be 
locally equivalent (written as 71 ~ T2) 5 . The behaviors of locally equivalent 
TASs are exactly the same at the tile attachment level. This easily leads us to the 
property that a sequence of assemblies ao, ct\, ■ ■ . , afc is a 7I-assembly sequence 
if and only if it is a Trassembly sequence. From this it follows that these TASs 
produce the same assemblies as well as the same terminal assemblies. In short, 
71 ~ T2 implies A[Ti] = A\Vz] and .4rj[7I] = -An [72]- As a corollary, we can see 
that given locally equivalent TASs, one is directed and strictly self- assembles a 
shape if and only if so is and does the other. 

The local equivalence ~ divides the set of all TASs into the equivalence 
classes. All the TASs in a resulting equivalence class behave exactly in the 
same way locally, and hence, we are allowed to use the term "behavior of this 
class." This means that the class can choose a TAS T — (T, a, g, r) it contains 
arbitrarily as representative in describing its behavior by the pair (T, {2? SiT (t) | 
t £ T}). This suggests a way to define a variant of TAS by assigning each t £ T 
with a set of subsets of {N,W, S,E} as a cooperation set. Chen, Doty, and Seki 
introduced this variant called strength-free TAS [S] as it is free from strength 
function or temperature. Formally, a strength-free TAS is a triple (T, a, T>), 
where T and a are the same as those for TAS, while V : T -> P(7>({N, W, S, E})) 
is a function from a tile type t £ T to a set of subsets of {N, W, S, E} that is closed 
under superset operation, where V means the power set. As done between TASs, 
we can define the local equivalence between a TAS (T, cr, g, r) and a strength- 
free TAS T s f = (T,o~,T>) as: they are locally- equivalent if T>(t) = r D g ^ T (t) for 
each t £ T. For an equivalence class, if 7~ s f is locally equivalent to an element of 
the class, then so is it to all of them. This observation allows us to regard the 
strength-free TAS as a representative of (the local behavior of) this class. Of 
particular note is that such a strength-free TAS is unique for each class. It must 
be also noted that there exists a strength-free TAS that does not represent any 
class, that is, that is not locally equivalent to any TAS. This implementability 
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check was formalized as FindStrength in [5], which is defined as follows: 

FindStrength 
INPUT : a strength-free TAS %i 

OUTPUT : a TAS that is locally equivalent to %t, if any, 

or reports that none of such TAS exists otherwise. 

A polynomial-time algorithm for this problem was proposed in [S]. 

The strength-free TAS was introduced as a technical tool to solve an open 
problem posed by Adleman et al. [2], In the paper, they proposed an algorithm 
to find a minimum size directed TAS T — (T, <7, <?, 2) that strictly self-assembles 
the n x n square Sq n , subject to the constraint that the TAS's temperature is 
2. Note that C dtncc ^ (Sq n ) = 0( lQ '°f O g J PQ. This algorithm enumerates all 
temperature-2 TASs with at most C dtllcc ^ (Sq n ) tile types, and checks whether 
each of them is directed and strictly self-assembles Sq n (this is proved to be 
polynomial-time checkable in n). The temperature of a system to be checked 
need not be 2, but rather the temperature-2 restriction^ is utilized to upper- 
bound the number of all candidates to be thus checked by a polynomial in n, and 
as a result, this algorithm runs in a polynomial time in n. The open problem 
was whether the temperature upper-bound could be removed. 

Chen, Doty, and Seki answered this problem positively by designing an algo- 
rithm that runs in polynomial time in n without relying on such an upper-bound 
[5]. Though being based on the above-mentioned idea by Adleman et al. basi- 
cally, their algorithm enumerates strength-free TASs with at most C dtllcc ^ (Sq n ) 
tile types instead of conventional ones, and commits extra check for the imple- 
mentability. This algorithm raised another problem of whether it could be 
modified so as to output among all the minimum size directed TASs for Sq n 
the one(s) working at the lowest temperature. This motivates us to study the 
following optimization: 

FindOptimalStrength 
INPUT : a strength-free TAS T B f 

OUTPUT : a TAS of minimal temperature that is locally equivalent to %f 
if any, or reports that none of such TAS exists otherwise. 

One of the main contributions of this paper is to prove the NP-hardness of this 
problem (Theorem [6]). 

3.1 Threshold Programming 

0-1 integer programming, one of the Karp's 21 NP-complete problems [8], is a 
decision variant of integer programming in which all the variables are restricted 
to be binary. In order to prove the completeness, Karp employed a polynomial- 
time reduction from the Boolean satisfiability problem (Sat) to this problem. 

In order to prove the NP-hardness of FindOptimalStrength, let us intro- 
duce a subclass of integer programming (IP) that aims at optimizing r subject 

1 This can be replaced with the temperature-c restriction for any constant c > 1. 
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to only > T -inequalities and < T -inequalities. We call such an IP a threshold 
programming (TP). This is formalized as: for given integer matrices C\,C2, 
minimize r on condition that there exists a nonnegative integer vector x satis- 
fying 

C\X> rl (> r -inequalities) and C2X < rl (< T -inequalities), 

where 1 is the vector all of whose components are 1 . 

FindOptimalStrength is actually a TP any of whose constraints is either 
a > r -inequality of at most 4 terms or a < r -inequality of at most 3 terms. In 
order to see this, let us consider a simple strength-free TAS with only one tile 
type t = (4,4,4,4) and a cooperation set V{t) = {{N,W, S,E}}. Finding a 
TAS at lowest temperature that is locally equivalent to this strength-free TAS 
is equal to solving the following system of inequalities with a positive integer 
variable r: 

'4+4+4+4 > t 
4+4+4 < r 

4+4 + 4 < r (1) 
£i+4 + 4 < r 
,4+4+4 < r 

so as to minimize r. The minimum r for this system is 4 with 4 — 4 = 4 — 
4 = 1 (with t being strictly less than 4, this system is not solvable). The first 
inequality in (JXJ) corresponds to {N,W, S,E} e T>(t), the second corresponds to 
{N, W, S} $ "D(t), and so forth. Though (fTJ) does not seem to contain an inequality 
corresponding to {N, W} $ T>(t), the second one (or third) actually implies this, 
and hence, not presented for the sake of space. In any case, the system for a 
cooperation set contains at most 15 inequalities. We denote the class of TPs any 
of whose constraints contains at most the number of terms as specified above by 
TP(4, 3). Its decision variant, denoted by t-ThresholdProgramming(4, 3) 
or simply r-TP(4, 3), is of interest in which t is not a variable but a given 
constant, and one is asked to decide whether x exists. 

Our arguments below mainly consist of designing systems of r-inequalities for 
various purposes. As a tool, we introduce a sub-system that will be embedded 
into these systems and force their variables to assume at least or exactly some 
specific value. It is built on the following pair of r-inequalities: 

%l + x a > t and x a < r. (2) 

This pair implies x\ > 1. Once ([2]) being embedded into a r- inequality system, 
the variable x\ cannot help but assume a positive value itself (x a is assumed to 
be an auxiliary variable occurring only in (|5J)). In the rest, when we say that a 
system has a positive variable x, we assume that its positiveness is guaranteed 
in this way. 

Let us build a sub-system called 2 l+1 -adder (to the lower bound of a variable). 
Based on a variable x with a lower bound n, i.e., x > n, it aims at creating 
another variable with a lower bound n + 2 l+1 . Using 5i+5 positive integer 
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variables Zi,Z2,Xo,Xb > x c ,Ak, A' k ,A k> B' k ,B k (l < k < i), we design the 2 i+1 - 
adder as follows: for 2 < j < i, 



A'l- 


f B£ + 




A'{ 




x 


> 


T, 


Ai- 


f Bi + 


Xo, 


Ai 


+ ^' + 


x 


< 


T, 
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+A'p 


Aj- 




+ A» 


> 


T, 


^_ 
Af 






4" 






< 


T, 






zi H 


- x c 




< 


T, 


zi H 






z 2 H 


- x c 




> 


T. 



(3) 



This is actually a modification of a system of inequalities proposed in [5] , and 
as shown there all the inequalities of ([3]) but those on the last two lines imply 

A'! > Ai + 2 i+1 - 2 (4) 

for 1 < k < i. The remaining four inequalities yield z\ > A'l + 1 and Z2 > Z\ + 1 
(in fact, they implement a 2 1 -adder). As a result, z^ > Aj + 2 l+1 . 

Assume that we have a variable a; whose value is at least a positive integer 
n. By setting x — Ai, we can have Ai in the 2'-adder assume not only be 
positive but be at least n\. Then z 2 > n + 2 i+1 . Combining (g]), Z\ > A" + 1, 
zi + .x c < t, Ai > n, and £&, x c > 1 deduces that r must be at least n + 2 !+1 + 1. 
It is important that for any t > n + 2 i+1 + 1, this system can be solved as 
follows: 

Ai = n,x = At = ■■ ■ = = 1, 

A' k = 2 k ,B' k = t — A' k - I, A'l = 2 fc+1 - 1, S^' = r - - 1 for 1 < k < i, 

A' l = n + T - 1, B[ = t - A\ - 1, A'! = n + 2 l+1 - 2, B'l =r- A'l - 1, 

Zi =n + 2 l+1 - l,z 2 = n + 2 i+1 ,x b = r- (n + 2 i+1 - l),x c = r- (n + 2 i+1 ). 

With the fact that any positive number m > 1 can be written as a sum of 
powers of 2, this property makes possible to combine multiple copies of 2 i+1 - 
adders inductively in order to provide a variable with an arbitrarily large lower 
bound. The number of variables involved in the system thus built is at most 

EliT™ 1 ( 5i + 5 ), which is 0((logm) 2 ). 

Concerning ([3]), the 2 l+1 -adder, there are two things to be noted. The 
first is that it consists of r-inequalities of at most three terms. The second 
is that we can divide its variables into four disjoint sets V\, Vz, V3, V4 such that 
each inequality contains at most one variable from each of the four sets. One 
such division is: V\ = {x , Zx, z 2 } U {A 2 k 1 A' 2k , A% k | 1 < k < [i/21}, V 2 = 
{A^A'^A 1 ^ \l<k< Lz/2J}, V 3 = {B j; B' v B>! \l<j< t}U{x b ,x c }, 
and V4 = 0. We say that a system of r-inequalities is quadripartite if 

1. every inequality in it consists of at most four terms; 

2. its variable set can be partitioned into four disjoint subsets so that distinct 
terms of each inequality belong to distinct subsets. 

(The first condition follows from the second and hence not necessary.) Quadri- 
partite systems of r-inequalities will play an essential role in Section [J] 
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3.2 FindOptimalStrength is NP-hard 



Let us prove the NP-hardness of FindOptimalStrength. For the reduction, 
we employ a variant of 3-Sat called monotone I-in-3-Sat introduced by Schae- 
fer in which no literal is negated (monotonicity) and one is required to 
find a truth assignment such that each clause has exactly one true literal. He 
proved its NP-complctcness. We propose its restricted variant called quadri- 
partite 1-in-3-Sat, whose instance consists of a variable set that is a union of 
four pairwise disjoint sets Ui , U2 , U3 , U4 and clauses that contain at most one 
variable from each of these four subsets. 

Lemma 2. Quadripartite I-IN-3-Sat is HP-complete. 

Proof. Let us denote a given instance of monotone I-in-3-Sat by a pair of a set 
U of Boolean variables and a set of clauses of three literals, which are positive 
due to the monotonicity. We will show a polynomial-time reduction from this 
to an instance of quadripartite 1-in-3-Sat. 

The reduction first transforms each clause of the given 1-in-3-Sat instance 
into a quadripartite conjunction of clauses while preserving the I-in-3-Sat sat- 
isfiability, and then conjuncts them. Each conjunction is designed so that it 
admits a partition of its variables into four disjoint sets one of which contains 
all variables of its source clause. The i-th clause {x, y, z} is converted into the 
following conjunction of 13 clauses: 

{-ix, a\, bi}{->y, a 2 , b 2 }{^z, a 3 , h}{a 1 ,a 2 ,a 3 } 

{-■x, c, d x }{^y, c, d y }{^z, c, d z }{c, e xy .f xy }{c, e yz .f yz } (5) 
{c, e zx -fzx } {d x , d y , e xy }{d y , d z , e yz \\ji z , d x , e zx } , 

where all variables but ->x, ->y, ->z are introduced exclusively for this clause. 
This transformation introduces the negated literals ~^x, -iy, -*z but they do not 
cause any problem because after all clauses being converted in this manner, no 
positive literals in the given 1-in-3-Sat instance remains. 

We claim that this preserves the I-in-3-Sat satisfiability. If the i-th clause is 
1-in-3-Sat satisfied, then due to the symmetry among x, y, z in ([5]), it suffices to 
examine the case x = l,y = z = 0. Then this conjunction is I-IN-3-Sat satisfied 
by setting a%,d x , e yzi f xy , f zx to be 1 and the others to be 0. If x = y = z = 0, 
then the first three clauses in ([5J imply a\ = a 2 = 03 = 0, but then the 
fourth clause cannot be satisfied. In the case x = y = 1, we consider two cases 
depending on the value of c. If c = 0, then d x = d y = 1 must hold in order to 
satisfy {-«, c, d x } and {~>y, c, d y }, but then the clause {d x , d y , e xy } contains too 
many true variables. Otherwise, d x = d y = e xy = 0, but then the clause cannot 
be satisfied. Therefore, the clause {x, y, z} is 1-in-3-Sat satisfiable if and only 
if so are all clauses in ([5]). 

The conjunction ([5]) is actually not quadripartite yet, and hence, needs fur- 
ther transformation. We replace its last clause {d z , d x , e zx } using the conversion 

{a,/3,7} = {^a,h,k}{^P,i,k}{^,j,k}{h,i,j}, (6) 
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where h, k are auxiliary variables introduced exclusively for this conversion. 
This conversion preserves the 1-in-3-Sat satisfiability and quadripartite prop- 
erty. A problem is that the resulting conjunction contains both positive and 
negative literals of d Zl d Xl e zx . Thus, to each of the four clauses that replaced 
{d Zl d x , e zx }, the same conversion must apply further. In this way, we finally 
obtain a conjunction of 28 clauses that admits quartering its variables as follows: 

Ui,i = {-ix, -iz, e xy , e yz , e zx }, 

Ui,i = {ai,h,c}, 

Ui.3 = {a,2, b\, d x , d z , f xy , f yz , f zx }, 

Ui,4 = {a 3 ,b 2 ,d y }, 

where the variables introduced via the conversion of last clause are omitted. 
Observe that the (negated) literals of all variables in the given SAT instance is 
in the same set U^\, and that (£7^2 U Ui_3 U 1^,4) n (Uj,2 U tj,3 U ^,4) = f° r 
an y 1 < i < j < n. These two properties immediately bring us the quartering 
of variables occurring in the resulting conjunction as Uk — Ui<i<n ^*,k f° r eacn 
fee {1,2,3,4}. □ 

Theorem 3. For anyr > 4, t-ThresholdProgramming(4, 3) is VP -complete. 

Proof. A proof for r = 4 comes first. An instance of quadripartite^ 1-in-3-Sat, 
whose NP-completeness was proved in Lemma [5J will be reduced into an instance 
of t-TP(4, 3). Let us represent this instance as a pair of a set of Boolean 
variables U = {u±, . . . , u n } and that of clauses C = {ci, . . . , c m }. 

Let us convert this SAT instance into a system S of r-inequalities with 
positive integer variables v%,V2, ■ ■ ■ ,v n (needless to say, ((2|) is used here for their 
positiveness), which correspond to the SAT variables in U, such that the SAT 
instance is satisfiable if and only if the system is solvable. In S, the j-th clause 
of C, Cj = {uj 1 ,Uj 2 ,Uj 3 } with 1 < ji,j2,j3 < n, is represented as 

v h + v n + v h > 4, v h + v n < 4, v n + v j3 < 4, and v n + v h < 4, (7) 

which is equivalent to the equation Vj 1 +Vj 2 +Vj 3 = 4 due to the assumption that 
Vj 1 , Vj 2 , Vj 3 > 1 . Its solution must be that exactly one of the three variables is 2 
and the others are 1. Therefore, if S is solvable, then by interpreting those in 
«!,...,«„ with value 2 be positive and the other (that is, with value 1) negative, 
we can retrieve a way to satisfy the 1-in-3-Sat instance; and vice versa. Thus, 
4-TP(4, 3) is NP-complete (in fact, we proved that even 4-TP(3, 2) is so, see 
0). 

Now the result is generalized for an arbitrary r > 4. In S, we first em- 
bed systems of T-inequalitics of at most three terms, presented in Section 13.11 

2 We have not applied this (simple) conversion directly to the i-th clause. This is because 
the variables involved in the conjunction thus obtained cannot be divided into four sets such 
that one of them contains all of -us, -fy, —<z. 

3 The quadripartite property is not needed here, but will be so in Section l4l 
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that provide auxiliary variables Xi,X2,x T —4 with lower bounds 1, 2, and r— 4, 
respectively. Combining them with an inequality 

Xi + x 2 + x T _ 4 < r (8) 

yields xi = 1, X2 = 2, and ir T -4 = r— 4. This means that by (JSJ being embed- 
ded, in any solution of S, these variables must admit these respective values. 
Especially, the "constant" x T _4 is added to the inequalities in (JT]), and we obtain 

Vj 1 + vj 2 + Vj 3 + x r _ 4 > r, (9) 

v ji + v n + Xt-a < t, (10) 

and the analogs of (|10l) for Vj 1 +Vj 3 and Vj 2 +Vj 3 . Since x r _4 = r — 4, these four 
r-inequalities are equivalent to Vj 1 + Vj 2 + vj 3 = 4. We conclude this proof by 
noting that any r-inequality used is either a > r -inequality of at most 4 terms 
or a < r -inequality of at most 3 terms, and the resulting system of r-inequalities 
is quadripartite. This quadripartite property will become critical in proving 
Theorem |U □ 

Theorem [3] leads us to the NP-hardness of TP. Deleting the constant term 
x T -4 from the inequalities (j5]), ©, and (fTU|) yields the following inequalities: 

x\ + x 2 < t, (11) 

v n + v n + v j 3 > r : v h + v n < T ' v h + v h < T > and v h + Vj 3 < t. (12) 
Consider optimizing r subject to these. Due to (|11[) . the minimal possible value 
of t is 4. Then its optimal value is 4 if and only if the 1-in-3-Sat instance is 
satisfiable. 

Lemma 4. ThresholdProgramming(4, 3) is HP-hard. 

Remark 5. All the systems of r-inequalities designed in this section so far can 
be solved even subject to extra condition that all variables being strictly less 
than r. This extra condition does not prevent the systems from playing their 
intended roles when being embedded. 

Using this instance, now we can prove that FindOptimalStrength is NP- 
hard. Making use of the fact that all of its r-inequalities contain at most 4 terms, 
we transform them into cooperation sets of a strength- free TAS. The inequalities 
(fl"2"|) . which are for the clause Cj, are encoded as t Cj — (vj 11 Vj 2 ,Vj 3 ,x T ) with 
V(t c .) = V({N, W, S, E}) \ {{N}, {W}, {S}, {N, W}, {N, S}, {W, S}}. Note that its east 
side is labeled with an auxiliary variable x r , but this variable does not play 
any essential role but filling the blank side (the tile is not triangle but square). 
Likewise, the inequalities that aim at forcing v±, . . . ,v n be positive are converted. 
The inequalities in ^ are encoded as a tile type ti = (x±, x a , x Tl x T ) with 
V{ti) = P({N,W, S,E}) \ {{N},{W}}. Though being not mentioned in ©, we 
can assume x\ < r as noted in Remark [5j The inequality pip is encoded as 
t' = (xi,x 2 ,x T ,x T ) with V(t') = P({N,W, S,E})\ {{N},{W}, {N,W}}. Then, there 
exists a TAS at temperature 4 that is locally equivalent to this strength-free 
TAS if and only if the given instance of 1-in-3-Sat is satisfiable. 

Theorem 6. FindOptimalStrength is HP-hard. 
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4 Tile complexity at high temperatures 



In Section |3l bridges from 1-in-3-Sat to TP(4, 3) and further to the local, or 
micro, behavior of a TAS have been established. Since the study of TAS aims 
at facilitating the design of nano-scale structures, we should shift our focus onto 
their global (or macro, terminal) behavior; how a given shape is built by TASs. 
Problems of interest include: for any temperature r given as parameter, 

1. Is there a shape S T that prefers the temperatures above r to the lower 
ones in terms of tile complexity; that is, C dtiloc « T )(SV) > C dtilcc M (5 r )? 
If so, then can we design an algorithm to construct S T ? 

2. Can we compute the directed tile complexity of a shape S at the temper- 
atures below r, that is, C dtllec (- T - ) (S'), in a polynomial time? 

For r = 2, these problems have been studied intensively. Adleman et al. proved 
that C dtile < 2 \Sq n ) = O( lo gf gj for any nxn square Sq n pQ. In contrast, 
Qdtilec(i) (Sq n ) < 2n — 1 and this bound is conjectured to be tight [2], which is 
highly probable. Thus, C dtilc « 2 \Sq n ) > C itilec ^(Sq n ), provided the conjec- 
ture is true. As for the second problem, it is NP-hard to compute the directed 
tile complexity at the temperatures at most 2 [2J. 

We will work on these problems without any constraint on temperature, and 
answer them by proving the following two theorems. 

Theorem 7. For any t > 2, there is a shape S T whose directed tile complexity 
is strictly lower at the temperature t than at any temperatures below t — 1 . 

Theorem 8. For any r > 4, it is NP-hard to compute the directed tile complexity 
of a shape at the temperatures below t. 

Theorem [7] is a positive answer to the first problem. Based on the results 
obtained on TP (4, 3), we propose a design of a shape S T for a given r > 2 that 
prefers the temperature r to the ones strictly below in terms of tile complexity; 
that is, C dtilcc « r >(SV) > C dtilec ( T )(5' T ). This should be, as of now, interpreted 
as an infcasibility result that the arbitrarily fine control of binding energies 
is necessitated to design TASs of "reasonable size" for a certain shape in a 
laboratory, which has not yet been realized, at least to my knowledge. 

Not only to this end but this design also makes possible to convert the 
instance of 1-in-3-Sat into a shape S and a constant c such that the instance 
is satisfiable if and only if there is a directed TAS of at most c tile types that 
strictly self-assembles 5 at a temperature below r for an arbitrary r > 4; that 
is, C dtllcc< -- r - l (S') < c. This amounts to the proof of Theorem which answers 
the second problem unless P = NP. Adleman et al. proved the analogous result 
for t = 2 2]. The case r = 3 remains open. This gap must be filled, but our 
proof cannot be applied to this case, at least directly. It is probably essential to 
transform an instance of 3-Sat into finely-crafted gadgets for this case as done 
by Adleman et al. in [2] for the case r = 2. This paper leaves this case open. 

We propose the following unified approach to various problems related to the 
behaviors and temperatures of directed TASs including the above two problems: 
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Figure 1: A logical component for a > r -incquality of four terms (left) and four 
variable trees (red, green, blue, and yellow from the left) that correspond to the 
four variables in the inequality. The positions numbered 1, 2, 3, and 4 are the 
cooperation tip of these trees, respectively. The logical component consists of a 
single gray tile at the position numbered 5 and four pillars that are of the shape 
identical to the four respective variable trees. 

Step 1. choose a quadripartite system S of r-inequalities properly for the pur- 
pose among those built in Section [3J hence, all > r -inequalities of S are 
of at most 4 terms and all of its < T -inequalities are of at most 3 terms; 

Step 2. convert its variables VxiV%, . . . ,v n into trees of height h and distinct 
shape, which we call variable trees, where h is a parameter adjustable 
for our convenience; 

Step 3. for each r-inequality, bundle the (at most 4) trees thus converted from 
its variables into a shape called the logical component] 

Step 4. (two copies of) the n variable trees and logical components each are 
mounted next to each other onto a scaffold, and amounts to a shape, 
which we denote by S. 

Examples of variable trees and a logical component are shown in Figure [TJ Note 
that S is parameterized by h. The design of the shape S through these four 
steps will bring a constant c h with the following property after its parameter 
h being made large enough. 

Property 1. Any directed TAS T = (T,a,g,r) that strictly self-assembles S 
using at most (n + c')h + c tile types has (not-necessarily-distinct) n glue labels 
£i,-^2i • ■ • An such that 

• for any m < 4 and ki, . ■ . , k m € {1, . . . , n}, if S includes the > r -inequality 
Ei<i<m% ^ T > then Ei<i< m s(4J > t; 

• for any m < 3 and ki,...,k m i G {1, ...,n}, if S includes the < T - 
inequality £i<j<m' v kj < r, then Ei<j<m' s(4 3 ) < r, 
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where d is the number of < T -inequalities in the system S chosen at Step 1. 

This property should be interpreted that the way for the function g of such a 
"small" T at the temperature r, if any, to assign labels with glue strengths tells 
how to satisfy the given system S of r-inequality. Conversely speaking, unless 
the given system admits a solution with r being some specific value r', any 
directed TAS at the temperature r' needs strictly more than (n + c')h + c tile 
typetQ in order to strictly self-assemble S. Verifying this property amounts to 
proofs of Theorems [7] and 13 it is sufficient for us to choose a proper S at Step 1. 
In order to prove Theorem [71 for k > 2, we choose the system of r-inequalities 
that cannot be solved for any r < k but can for any r > k. This is the one 
designed in Section IQ1 based on the 2* +1 -adders. It is transformed into a shape 
S through Steps 2-4. This amounts to a proof of a slightly-stronger version of 
Theorem [Jj As for Theorem the TP instance built for Theorem [3] is rather 
chosen. In the rest of this section, therefore, we just have to verify Property Q] 
in order to complete the proof of these theorems. 

Let V — {vi, t>2, . . . , v n } be the set of variables occurring in the system S 
chosen at Step 1. As mentioned in Remark [SJ we can assume that every variable 
in V is strictly less than r; this assumption simplifies the design of S and our 
explanation below. The more essential is that S is quadripartite, that is to say, 
the variable set V can be partitioned into four disjoint subsets V N , Vy, V s , V E . 

Step 2 

In this step, we convert each variable in V into one of the four tree shapes 
illustrated in Figure [1] depending on which of Vn, Vw, Vs, Ve it belongs to. For 
instance, each variable in Vn is converted into the shape whose tip is numbered 
1 (and colored red). This shape is a tree of height h plus some constant, not 
depending on h, with only one crotch for two branches; one consists of a bit 
pattern and the other is of size 1 for cooperation called cooperation tip, which 
is numbered 1 in Figure [TJ We say that this shape is of north type after the 
subscript of Vs. Shapes of north type that are thus converted from distinct 
variables (in Vjj) are identical mod their bit patterns of length [log n \ . In this 
way, each variable in the other three variable subsets Vy, V$, Vs is also converted 
into the shapes that are numbered 2, 3, 4 (and colored yellow, green, blue) in 
Figure [TJ respectively, and we say that these shapes are respectively of west, 
south, and east type. All the n variables of V have been now associated with n 
different tree shapes of proper type. We collectively refer to them as variable 
trees. Two copies of each variable tree is mounted onto a scaffold. 

Every conversion of < T -inequalities in S into logical components at Step 3 
needs to introduce an auxiliary tree of the north, west, south, or east type 
and recall that S was assumed to contain d of them (the conversion of > T - 
inequalities does not). These trees are distinguished from each other and from 
the variable trees by their bit patterns. Two copies of each auxiliary tree are 

4 Actually, it will be proved necessary for T to have at least (n + c' + l)h tile types, which 
is much larger than (n + c')h + c because h^> C2- 
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mounted next to each other on the scaffold of S. We will call variable trees and 
auxiliary trees simply trees unless confusion arises. 

We claim that T needs at least (n + c')h tile types for T to strictly self- 
assemble these n + d trees mainly using the results by Adleman et al. in [5] on 
tile complexity of trees. Being duplicated, each tree has its copy that assembles 
upward from the scaffold (actually, all trees but at most one do so since T is 
assumed to be singly-seeded). Theorem 4.3 in [2] implies that in order for T to 
strictly self-assemble two variable trees of different types (e.g., north and west), 
at least 2h plus some constant number of tile types are necessary; h exclusively 
for each, no matter how and at what temperature T is designed, where the 
constant is much smaller than h. This is not to say that T could not reuse any 
tile type for both of them. It can, for example, for their bit patterns. However, 
the number of such reusable tile types is bounded by a constant that is much 
smaller than h. This lower bound exists also for two trees of identical type but 
with distinct bit patterns. It is not allowed for T to put tiles of identical type at 
the crotch positions because if they were the same, incorrect bit patterns could 
assemble. In contrast, T is able to reuse tiles of same type for the cooperation 
tips of these trees, but anyway it saves only one tile type. This observation 
is immediately generalized for more variable and auxiliary trees as: in order 
for T to assemble the n distinct variable trees and d auxiliary trees, at least 
(n + d)h + c" tile types are necessary for some constant c" <C h. In brief, 

C dtilcc(r) (5) > ( n + d)h + c" 

at any temperature r. 

Before proceeding to the explanation of next step, we prove that T must put 
the same tile type at the cooperation tip positions of two copies of each variable 
tree or auxiliary tree. This is related to the position of the seed of T so that 
first we see that it merely costs the number of tile types for T to put its head 
on a variable tree or an auxiliary tree. Suppose T put its seed on one copy of 
some variable tree, and let us compare it with the other copy of the same tree. 
Recall that they are located next to each other. In order to start assembling the 
seed-free copy, T first must assemble downward from the seed and the scaffold 
between the copies. Then consider the process for T to assemble the seed-free 
copy upward up to the counterpart position of the seed. During this process, T 
cannot reuse any tile type used for the part between the seed and scaffold; such 
a recycle would enable T to skip the assembly of the seed-free copy and produce 
a shape with only one copy of the variable tree. Thus, even if the seed is on 
a copy of some variable tree, it could not be located "far from" the scaffold if 
it were not for extra n tile types available for T. This means that T needs to 
assemble almost all parts of these trees upward, and hence, it would impose the 
unaffordablc number of extra tile types on T to put tiles of distinct types at the 
cooperation tips of two copies of a tree. Now, for 1 < i < n, we can denote the 
tile type that T puts at the both cooperation tips of the copies of the variable 
tree for Vi by ti. The label ti mentioned in Property Q] is found at the side of ti 
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opposite to the crotch. That is, 



'U(S) ifVi€V s 
= I U(E) if Vi e V v 
*"U(n) ii Vl eV s 
^i(W) ifuiSV^. 

Step 3 

In this step, we convert > r -inequalities and < r -inequalities of the given system 
S into shapes which we call logical components. Two of their copies will be 
mounted in Step 4 onto the scaffold, which will complete the design of the 
shape S. 

Let us begin with a simpler one: the logical component for > r -incquality (of 
at most 4 terms). Recall that any > r -inequality in S is of at most 4 terms and 
none of its two terms are variables taken from the same variable subset. That is 
to say, the variable trees that its terms (variables) correspond to are of pairwise 
distinct type, and hence, can be bundled together so as for their cooperation 
tips to be adjacent to one position (see Figured! where the position is numbered 
5). The logical component for this > T -inequality consists of the variable trees 
thus positioned, which we refer to as pillars, and the position adjacent to all the 
cooperation tips of the pillars. In this manner, each > r -inequality is converted 
into a logical component and its two copies are mounted onto the scaffold of S. 

Proof of the first item of Property [7] 

Now we will prove the first item of Property[T] that is, if the > T -incquality 

E ( 13 ) 

1 < 2 < m 

belongs to 5, then 

£ 2(4,) >t, (14) 

l<i<m 

where m < 4 and ki, . . . , k m E {1, . . . ,n}. Before that, however, let us 
quickly show that if (fT4|) holds, then T can assemble the logical com- 
ponent for (fT3|) by introducing only one new tile type in the following 
manner. T first assembles the respective parts of the component that cor- 
respond to the variable trees for , ■ • ■ , Vk m as done for the copies of these 
trees (hence, costs nothing) and lets their cooperation tips to cooperate to 
attach a tile of the new type to fill the position surrounded by the m co- 
operation tips, which is numbered 5 in the logical component in Figure [TJ 
This cooperation provides enough strength for stable tile attachment due 
to (fT4")l . Let us return to the proof. For the sake of contradiction, suppose 
(fl"4"|) does not hold. Focus on the seed-free copy of the logical component 
for (|13p for the concise argumentation. Since this copy is free from seed, 
T needs to assemble at least one pillar of the component upward from the 
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Figure 2: A logical component for a < r -inequality (of 3 terms). The trees of 
north, west, east type are variable trees (red, yellow, blue), while the remaining 
tree of south type is an auxiliary one (white). The trees are written concisely 
for clarity, but they should be of the same shape as illustrated in Figure [TJ 

scaffold. Under the supposition that (fl4|) did not hold, T is not allowed 
to assemble these upward-growing pillars as done for the corresponding 
variable trees without any extra tile types. More precisely, at the cooper- 
ation tip of at least one of these pillars, say the one for Vkt , T must put 
a tile of different type from . However, this costs extra at least h tile 
types because r cannot reuse any tile types for the tree for in order to 
assemble this pillar, or any tile types for the any other tree Ufc. because 
T was proved not to be able to assemble and using strictly less 
than 2h tile types (this argument works in order to prove that this pillar 
cannot assemble using tile types for auxiliary trees). □ 

The design of logical components for < T -inequalities (of at most 3 terms) 
is more involved. One thing to be noted first is that this cannot be realized 
simply by modifying the above-mentioned component design for > T -inequalities 
by leaving the position adjacent to all the cooperation tips of the pillars empty. 
This is because no attachment may not mean the insufficient strength but label 
mismatch. We propose the following design instead. 

Let us consider a < r -inequality: 



for some m' < 3 and fci, . . . , k m ' S {1, . . . , n}. In addition to the m! variable 
trees for , . . . , Vk , , we employ one of the c' auxiliary trees prepared in Step 2 
that is of distinct type from any of the variable trees (recall that d is the 
number of < T -inequalities in S). Then we combine copies of at least m' of these 
m' + 1 trees as done in building the components for > r -inequalities but in all 
combinations of m! trees. The combination of all the m' + 1 trees is turned 
into a "gadget" by filling the position that are adjacent to all the cooperation 
tips, whereas the other combinations are considered to be gadgets without any 
modification. These m' + 2 gadgets amount to the logical component for the < T - 
inequality (|15j). and two copies of this component are mounted on the scaffold 




(15) 



1<j <m ; 
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of S. Note that, once being used, the auxiliary tree will not reused for the 
component of other < T -inequalities any more. 

Proof of the second item of Property^ 

Now we will prove that if (fT5|) holds, then 

E 9(W<r. (16) 

l<j<m' 

Before this proof, let us note that since the auxiliary tree does not appear 
in any other component, one can have g assign an arbitrary strength to the 
glue at the cooperation tip of this tree without causing any malfunctioning 
of the other component, as long as (fT6)l holds. Setting this strength to be 
r — J2i<j< m ' v kj enables T to assemble this component with introducing 
only one new tile type that fills the position adjacent to the cooperation 
tips. 

For the sake of contradiction, suppose that the sum J2i<j< m ' 9(^j) were 
at least r. Focus on the seed-free copy of this component, and especially 
first on the gadget with ml + 1 pillars, with the position surrouned by 
the cooperation tips being filled (the leftmost gadget in Figure [5]). Let us 
denote this gadget by G\. Due to the seed-freeness, some of the pillars 
of G\ assemble upward and have a tile t fill the position by cooperative 
attachment. Assume that some pillar is absent from the cooperation. Let 
us denote the gadget that does not contain the "lazy" pillar by G2. Then 
some of the tiles put at the cooperation tips of G2 must be distinct from 
those at the corresponding positions of G\ in type in order to prevent 
the tile t from attaching to G2. However, this distinctiveness would cost 
T extra h new tile types, as explained in the proof of the first item. 
Thus, all of the pillars of G\ must join the cooperation, and hence, all 
the pillars of G\ assemble upward. Then consider the gadget that is free 
from the auxiliary pillar, which we denote by G3. The types of tiles at the 
corresponding cooperation tips of G\ and G3 must be the same in order 
for T not to pay the unaffordable extra h tile types. However, then there 
must exist 1 < j < ml such that the tile tj at the cooperation tip of the 
tree for v j and the tile at the corresponding position in Gi must differ 
from each other in type because of the supposition J2i<j< m ' di^-i) — T - 
This again causes the unaffordable cost. Consequently, we can say that 
(fT6|) must hold. □ 

As the readers might have already noticed, the constant c is the numbers of 
tile types for the scaffold (actually, equal to its length, which depends only on 
the number of variables and the number of inequalities in S) plus the number 
of tile types for each variable tree that exceeds h (top structures differ tree by 
tree, but a bit). 

When being introduced, the directed tile complexity at the temperatures 
at most r was noted not to be a monotonically decreasing function. Let us 
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conclude this section and hence this paper by its proof. Since we do not find 
this non-monotonicity significant in practice, we simply exemplify it and will 
not inquire deeper into the matter. Actually, this proof is a good opportunity 
to show the usefulness of the unified framework we proposed previously. 

Lemma 9. There exists a shape S such that C dtilec ( 3 )(S') < C dtilcc ( 4 )(S'). 

Proof. As explained into details when being introduced, the framework needs 
only our appropriate choice of a system of r-inequalities at Step 1. Actually, 
it suffices to find as such a system the one that is solvable for r = 3 but not 
for t = 4. Such an equation, however, was already given as (H) in Section |3~T1 
Needless to say, all > T -inequalities and all < r -inequalities in this system are of 
at most 4 and 3 terms, respectively, and this system is quadripartite. □ 

A problem of theoretical interest is whether for any temperature r, there 
exists a shape S such that C dtllec ^ £ - ) (S) is strictly decreasing over the interval 
[1, r]. The framework cannot be applied, or even if it can, some modification is 
necessary. 

5 Conclusions 

In this paper, we continued research on the behavioral equivalence among TASs 
at the local level that was initiated in [5] and solved their open problems of 
whether minimizing the temperature of TASs that behave locally as specified 
as input can be solved in a polynomial time by proving that this problem is NP- 
hard. We deduced this from our other result that the threshold programming, a 
special form of integer programming, is still NP-hard. Furthermore, we proposed 
a unified framework to work on various problems related to the temperature 
and the global behavior of TASs, which makes use of systems of r-inequalities 
designed for the proof of the above results. 

There are several directions for further research. The NP-hardness stated in 
Theorem [5] never eliminates the possibility to improve the algorithm by Chen, 
Doty, and Seki so as to optimize the size and temperature of TASs for the n x n 
square Sq n simultaneously. Elucidating this will deepen our knowledge of the 
tile complexity of Sq n further. As for Theorem [3 our interest lies on the ratio 
C dtUec«T)^ T y C dtilec(T)^ T ^ In our construction of S T , the ratio is constant 

no matter how h is adjusted. In contrast, C dtilcc( - 1 \Sq n )/C dtilc < 2 '> (Sq n ) can get 
arbitrarily large by making n larger, provided the conjecture C dtllcc ^ (Sq n ) = 
2n — 1 is true (or even C dtllec ^ (Sq n ) = f2(n) is fine). Can we find such a shape 
for arbitrary temperature? With Theorem[8]and the result by Adleman et al. [2], 
the NP-hardness of computing directed tile complexity at the temperatures below 
3 must be proved. 
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